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NONPARAMETRIC INFERENCE OF QUANTILE CURVES FOR 
NONSTATIONARY TIME SERIES^ 

By Zhou Zhou 

University of Toronto 

The paper considers nonparametric specification tests of quantile 
curves for a general class of nonstationary processes. Using Bahadur 
representation and Gaussian approximation results for nonstationary 
time series, simultaneous confidence bands and integrated squared 
difference tests are proposed to test various parametric forms of the 
quantile curves with asymptotically correct type I error rates. A wild 
bootstrap procedure is implemented to alleviate the problem of slow 
convergence of the asymptotic results. In particular, our results can 
be used to test the trends of extremes of climate variables, an im- 
portant problem in understanding climate change. Our methodology 
is applied to the analysis of the maximum speed of tropical cyclone 
winds. It was found that an inhomogeneous upward trend for cyclone 
wind speeds is pronounced at high quantile values. However, there is 
no trend in the mean lifetime-maximum wind speed. This example 
shows the effectiveness of the quantile regression technique. 

1. Introduction. After fitting a nonparametric model, one often asks 
whether it can be simplified into certain parametric, semiparametric or more 
parsimonious nonparametric forms. Recently, there has been an enormous 
interest in developing nonparametric specification tests; see, for example. 
Hall and Titterington (1988), Eubank and Speckman (1993), Hardle and 
Mammen (1993), Ingster (1993), Zheng (1996), Hart (1997), Stute (1997), 
Xia (1998), Horowitz and Spokoiny (2001), Fan, Zhang and Zhang (2001) 
and Fan and Jiang (2007) among others. Many of the previous results con- 
cern nonparametric inference of the (conditional) mean or density functions 
for independent data. 
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The primary goal of this paper is to perform nonparametric specification 
tests of quantile curves for a class of nonstationary processes that can be 
called locally stationary processes [Draghicescu, Guillas and Wu (2009) and 
Zhou and Wu (2009)]. Conceptually, the local stationarity is characterized 
by the smoothly time-varying data generating mechanisms of the processes. 
More precisely, let {Xi^n}^=i be the observed sequence. We shall adopt the 
following formulation: 

(1) Xi^n = G{i/n,Ti), z = l,2,...,n, 

where Ti = (. . . , ej_i, e^), e^, i £ Z, are independent and identically dis- 
tributed (i.i.d.) random variables, and G : [0, 1] x M°° i— >■ M is a measurable 
function such that Ci{t) •= G{t,J^i) is a properly defined random variable for 
all t G [0, 1]. Here, local stationarity means that the random function Ci{t) is 
smooth in t G [0, 1] in an appropriate sense. Process (1) covers a wide range of 
nonstationary linear and nonstationary nonlinear processes and it naturally 
extends many existing stationary time series models into the nonstationary 
setting. See Zhou and Wu (2009) for more discussion and examples. In the 
sequel, for notational convenience, we shall write Xi^n as Xi. 

The paper is motivated by the important problem of understanding the 
trends of extremes and variability of climate variables. As stated in Katz and 
Brown (1992), ^^understanding climate change demands attention to changes 
in climate variability and extremes." By interpreting climate extremes as up- 
per and lower quantiles and climate variability as interpercentile ranges, the 
nonparametric quantile estimation [Koenker (2005)] provides a simple and 
effective means to address the latter problem. On the other hand, climatol- 
ogists often want to know whether a simple linear or quadratic function is 
appropriate to describe the trends. The parametric description of the trends 
is more interpretable and efficient than its nonparametric counterparts if 
the parametric model is correctly specified. To address the latter issue, it 
is necessary to develop nonparametric specification tests for the quantile 
curves. 

For independent data, there have been a few results on nonparametric 
specification tests of quantile curves. He and Zhu (2003) and Kim (2007) 
proposed tests based on the cusum process of the residuals and the Rao- 
score statistics, respectively. The tests use fitted values and residuals under 
the null hypothesis (namely the parametric model) and therefore enjoy the 
advantage of avoiding the need for nonparametric function estimation. On 
the other hand, however, the above tests are sensitive only to a restrictively 
small class of alternatives; namely alternatives in the neighborhood of the 
parametric null. An alternative specification test which achieves minimax 
rate over a large class of smooth functions is proposed in Horowitz and 
Spokoiny (2002). It seems that the test is tailored to the specific problem 
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of testing linearity. For other contributions, see Zheng (1998), Rosenkrantz 
(2000) and Wang (2007, 2008). 

To our knowledge, testing parametric functional forms of quantile curves 
under the time series setting has not yet been considered in the literature. 
In this paper, we shall conduct the specification tests by directly comparing 
the nonparametric quantile estimates and the parametric null. In principle, 
if some global measure such as the C°° norm or the norm is large for 
the difference between the nonparametric fits and the parametric null, then 
there is evidence against the parametric null hypothesis. This approach is 
intuitively plausible and it is easy to understand. For nonparametric infer- 
ence of the mean functions, this very idea has been widely applied; see, for 
example, Eubank and Speckman (1993) and Hardle and Mammen (1993) 
among others. 

Nevertheless, obtaining the asymptotic distributions of appropriately nor- 
malized global measures of the deviations of the quantile curves has been a 
very difficult problem when dependence is present. Generally speaking, the 
latter problem can be solved if we have (i) unform Bahadur representations 
of the estimated quantile curves and (ii) a sharp Gaussian approximation 
result for the weighted empirical processes of the nonstationary time series 
{^i)^=i- Recently, Zhou and Wu (2009) obtained a sharp uniform Bahadur 
representation for the local linear quantile estimates of locally stationary 
time series. On the other hand, Wu and Zhou (2010) established Gaussian 
approximation results for partial sums of nonstationary time series with 
nearly optimal rates. An extension of the latter result addresses the above 
issue (ii); see Theorem 4 below. 

With the recent progress in Bahadur representations and Gaussian ap- 
proximations for locally stationary time series, we are able to construct in 
this paper simultaneous confidence bands (SCB) (based on the norm) 
and integrated squared difference tests (ISDT) (based on the £^ norm) for 
the quantile curves as tools for the nonparametric inference. The SCB is 
shown to asymptotically achieve the correct coverage probability. Moreover, 
we prove that the ISDT asymptotically attain the correct type I error rate 
and are asymptotically optimal in terms of rates of convergence for non- 
parametric hypothesis testing in the sense of Ingster (1993). Our results 
shed new light on nonparametric specification tests of M-type estimates for 
nonstationary time series. 

The rest of the paper is structured as follows. Section 2 introduces the lo- 
cal linear quantile estimates and the dependence measures. Section 3 presents 
the asymptotic results on the nonparametric specification tests. In particu- 
lar, asymptotic results on the SCB and ISDT are presented in Sections 3.2 
and 3.3, respectively. A wild bootstrap procedure is introduced in Section 4 
to address the issue of slow convergence of the SCB and ISDT tests. Section 
4 also contains discussions on bandwidth selection and nuisance parameter 
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estimation. Section 5 presents two simulation studies to compare accuracy 
and sensitivity of various nonparametric tests. The global tropical cyclone 
data is studied in Section 6. Proofs are given in Section 7. 



2. Preliminaries. We now introduce some notation. For a vector v = 
{vi,V2, . ■ ■ ,Vp) £ W, let |v| = (XliLi vfy^"^- For apxp matrix A, define \A\ = 
sup{|Av| : |v| = 1}. For a random vector V, write V S (g' > 0) if ||V||q : = 
[E(| Vj'^)]^/'^ < oo and ||V|| = ||V||2. Denote by =^ the weak convergence. For 
an interval X C M, denote by i S N, the collection of functions that 
have ith. order continuous derivatives on I, and, for V CM'^, let CD be 
the collection of real-valued functions that are continuous on D. A function 
/ : M*^ — )• M is said Lipschitz continuous on P C M'^ if there exists a finite 
constant C, such that \f{xi) — f{x2)\ < C\xi — X2\ for all xi,X2 G T>. For 
X G M, define x~^ = max(rE, 0). The symbol C denotes a finite generic constant 
which may vary from line to line. 



2.1. Local linear quantile estimator. Recall Qit) = G{t,J^i). Let F(t,x) = 
IP(Ci(^) ^ x), X G M, be the cumulative distribution function (cdf) of CiC*); 
t G [0,1]; let Qa{t) be the ath quantile function of 0(^)7 G (0,1), namely 
Qa{t) =\nix{F{t.,x) > a}. Suppose Qa{t) is smooth on [0,1]. As Qa{ti) ~ 
Qa{t) + (^1 — OQoW for ti close to t, Qaif) and (^^(i) can be estimated by 
the local linear approach [Koenker (2005)] 

n 

{Qa,bAt),Q'aAM) = argmin V/j,(Xi - /3o - /3i(ti - t))Kb„iti - t), 

where ti = i/n, pa{x) = ax^ + (1 — «)(— x)~^ is the check function. Here K 
is a kernel function, K}j^{-) = K{-/bn) and bn = bn{a) > is the bandwidth 
depending on a. We shall omit the subscript 6„ in Q and Q' hereafter if no 
confusion will be caused. 



2.2. The dependence measures. It is shown [Condition (Bl) of Zhou and 
Wu (2009)] that the asymptotic behavior of the local linear quantile estima- 
tor is determined by the dependence structure of 

a^T|G(i, J"i+i) < 
Fk{t,x,T,):= dxk ' A; = 0,1,2,3. 

To quantify the dependence structure of the above processes, let us con- 
sider a generic nonlinear system {/^(t, x, J^j)}^^^, where {t,x) G [0,1] x M 
and : [0, 1] x M x R°° i-> M is a measurable function such that H{t,x,Fi) 
is well defined for all {t,x) G [0,1] x M. Let e^, i G Z, be an i.i.d. copy of 
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Ej, j € Z. For A; > 0, let J^^ = (J-"-!, Eq, ei, . . . , e^) and define the physical 
dependence measure 

(2) 6Hik,p)= sup \\H{t,x,Tk)-Hit,x,T;)\\p. 

{t,x)&[0,l]xR 

Here we recall || • ||p = [E(| • |p)]Vp. Note that SH{k,p) measures the overall 
dependence of H(t,x,Tk) on the input Eq. The physical dependence mea- 
sures are by their definition closely related to the data generating mechanism 
and hence are easy to work with; see Section 4 of Zhou and Wu (2009) for 
the related calculations for locally linear and nonlinear time series models. 

We shall call the system {H{-, ■,J^i)}i^z uniformly geometric moment con- 
tracting of order p [UGMC(p)] if 6H{k,p) decays exponentially with respect 
to k; namely 

(3) fe(fc,p) = 0(/), 0<x<l. 

Following Section 4 of Zhou and Wu (2009), condition (3) is readily verifi- 
able for a large class of nonstationary nonlinear processes and nonstationary 
linear models. All our results will be presented in terms of the physical de- 
pendence measures and the UGMC conditions. 

Note that UGMC (2) is stronger than the stability condition of Zhou and 
Wu (2009). Analogous results of this paper can be proved with Suik^p) de- 
caying algebraically at a sufficiently fast rate. However, the technical details 
are much lengthier and we chose to use the UGMC condition for clarity of 
presentation. 

3. Main results. 

3.1. Assumptions. We shall make the following assumptions on the pro- 
cess {Xi) and the kernel K. 

(Al) /(t, x) is Lipschitz continuous on [0, 1] x M, where /(t, •) is the density 

function of Qiif). Assume inf^gp,!] fitiQa{t)) > 0. 
(A2) Let J{t,x,Ti) = I{G{t,Ti) < x} and 

oo 

(4) a\t)= J2 cov[J{t,Qa{t),To),J{t,Qa{t),Ti)]. 

i=— oo 

Assume a{t) is Lipschitz continuous on [0, 1] and inf(g[o j] a{t) > 0. 
(A3) (Stochastic Lipschitz continuity condition.) There exists q > 1, such 

that WCiih) -Ci{t2)\\q<C\ti-t2\ holds for ah ti,t2 S [0,1]. 
(A4) Assume that, for k = 1,2,3, Fk{t,x,Fi) := F{t,x,Fi) / dx^ exists and 

YlkLo^F^ikA) < oo, for i = 1,2,3. 
(A5) F{t,x,Ti) is UGMC(4). 
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(A6) There exists Co < oo such that supf^^^^-^^^Q^i^y.^ Fi(t, x , Ti) < Cq almost 
surely. 

(Kl) K{-) £ IC, where fC is the collection of density functions K such that 
K is symmetric with support [—1, 1] and K G 1, 1]. For K{-) G /C 
and i > 1, define 



'^K = j K'^{x)dx, (tK ■ 



J^^\K'{xrdx 



(PK 

Hj^K = J x^K{x) dx. 

We shall omit the subscript K in the sequel if no confusion will be 
caused. 

A few remarks on the above regularity conditions are in order. The func- 
tion o'^(t) in condition (A2) is called the long-run variance function of 
{J(t,(5a(t), to account for the dependence of the series. Note 

that cr^(t) is well defined even for heavy-tailed processes (Xj) since < 
I{G{t,J-i) < x} < 1. Condition (A3) means local stationarity and it asserts 
smoothness of G{t,J^i) with respect to time t. Conditions (A4) and (A5) 
assert that processes {Fk{t,x,Ti)} , A; = 0,1,2,3, are short range dependent 
(SRD). Conditions (A3)-(A5) can be verified for a large class of nonstation- 
ary linear and nonlinear processes by the arguments in Section 4 of Zhou 
and Wu (2009). Condition (A6) is mild and it means that the conditional 
density function of J{t,x,J-'i) is bounded. A popular choice of the kernel 
function is the Epanechnikov kernel K{x) = 3max(0, 1 — 2;^)/4. 

3.2. Simultaneous confidence bands. The simultaneous confidence band 
(SCB) is a classic tool for nonparametric inference. To construct a 100(1 — 
/3)% SCB for Q„(-), one finds two functions / and u depending on (^j)"—]^, 
such that 

lim ¥{l{t) < Qa{t) < u{t) for all t G (0, 1)) = 1 - /3. 

n— >-oo 

A candidate function for Qa{') is rejected at level /3 if it is not fully con- 
tained in the SCB. The SCB provides appreciable direct visual information 
on the overall variability of the fitted curves. See, for example, Bickel and 
Rosenblatt (1973) for the inference of density functions; Eubank and Speck- 
man (1993), Sun and Loader (1994), Neumann and Kreiss (1998), Wu and 
Zhao (2007) and Zhao and Wu (2008) for the inference of (conditional) 
mean functions and Fan and Zhang (2000) for the inference of coefficient 
functions of varying coefficient models. In this section, we shall establish the 
asymptotic theory for the maximal absolute deviation of Qait) from Qa{t) 
on (0,1). The theoretical results facilitate construction of a SCB of Qait) 
which asymptotically achieves the nominal coverage probability. 
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Theorem 1. Assume G C^[0, 1] and conditions (Al)-(A6) and 

(Kl) hold. Further assume T/n6„/log^ n — t- oo and nbj^logn ^ 0, then we 
have 



lim I 

n— >oo 



sup 

t&Tn 



(5) 



^nKf{t,Qa{t)) 
^a{t) 

x\Q^{t)-Qa{t)-fi2blQ'i{t)/2\ 
-B{m*)< 



-2e- 



where Tn = [^n, 1 — &n.]i tttl* = Q?^'^ 

5(m*) = (21ogm*)^/2 + (21ogm*)"^/2[loge:- 21og7r - 2 log 2] /2. 

For a fixed level /3, Theorem 1 implies that one can construct a 100(1 
/3)% simultaneous confidence band for Qa{t) 



(6) g«(t)-;U2M(t)/2± 



where = — loglog[(l — /3)~^/^]. By Theorem 1, SCB (6) asymptotically 
achieves the right coverage probability 1 — /3. Note that V(t) :- 



is the asymptotic standard deviation of Qait) [Theorem 1 of Zhou and Wu 
(2009)]. 

The following theorem concerns the local power of the SCB (6). 

Theorem 2. Suppose Qg jt) = Q° (t)+7„?7(t) + o(7n), where Q°^{t),r]{t) G 
C[0, 1], 7„, = \/yJ—2nbn log hn and o{^n) is uniform in t on [0, 1]. Then under 
conditions of Theorem 1, we have 



lim ] 

n— ^oo 



sup 



(7) 



^f{t,Qa{t)) 

^a{t) 

x|Qa(t)-Q°(t)-/i26^g"(t)/2| 

-B{m*)< 



-s{ri)e 



where 



r,{t)f{t,Q^{t)) 



^ait) 



dt. 
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Theorem 2 follows from similar arguments to those in the proof of Theo- 
rem 1 and Theorem Al of Bickel and Rosenblatt (1973). Details are omitted. 

Theorem 2 implies that SCB (6) can detect alternatives with the rate 7^- 
For the 100(1 — /3)% SCB (6), the asymptotic power of the test Hq : Qa(t) = 
Ql,{t) versus Ha : Qa{t) + Q°At) is 

(8) 1 - (1 - /3)"('')/2 

under the local alternatives specified in Theorem 2. Since sij]) > 2 and s{rj) = 
2 if and only if r]{t) = 0, our test based on the SCB is always asymptotically 
unbiased under such alternatives. In the case of density function inference 
for i.i.d. data, the same result was obtained by Bickel and Rosenblatt (1973). 

3.2.1. Optimality of the SCB. If the local linear smoothing technique is 
adopted and the bandwidth series (6„) is fixed, then SCB (6) is optimal in 
the sense that asymptotically it covers the minimum area. To see this, a 
Lagrange multiplier argument can be implemented. A similar argument can 
be found in Zhou and Wu (2010) for nonparametric inference of time- varying 
coefficients in functional linear models. Note by equations (46) and (47) in 
Section 7, we have under conditions of Theorem 1, 

(9) sup I [Qa{t) - Qa{t) - ii2hlQ'^{t)/2]/V{t) - (dn{t)\ = o^{\og-^'^ n), 
taTa 

where 0n(t) = Z^ILi ^«-^fcn ~ t)/\/ 4>nhn with (^4)"=! i.i.d. standard nor- 
mal. In other words, the simultaneous fluctuations of [Qa{t) — Qa{i) — 
H2bnQa{'t)/'^]/^{'t) are asymptotically equivalent to those of @n{t)- Let Si = 
{2i - 1)6„, z = 1, . . . where Qn = [1/(26„)J. From (9), Qa{si) - Qa{si) - 
/i2^n'3a(*j)/2 are asymptotically independent N{Q,V'^{si)). Suppose a band 

l{Si) < Qa{Si) - Qa{Si) - ll2bnQa{Si)/2 < u{si), i = 1,2, . . . , Qn, 

achieves a preassigned coverage probability 1 — f3. From the above discus- 
sion, we see that the coverage probability restriction can be asymptotically 
written as 

c(n, bn) := llmu{si)/V{si)) - ^l{si)/V{si))] = 1-/3, 

i=l 

where ^{■) is the normal cumulative distribution function (cdf). In order to 
achieve the minimum average length, one minimizes the following Lagrange 
multiplier: 

Qn 

(10) Y^Hsi) - lisi)] - A{log[c(n, bn)] - log(l - m- 

i=l 
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Simple calculations show that the minimum is achieved at u{si) = — /(sj) = 
g{n,bn, f3)V{si), where g{n,bn,/3) is a deterministic function. The important 
message here is that the asymptotically optimal SCB at each time point 
t should have length proportional to the asymptotic standard deviation of 
Qa{i), which is the case in our construction. 

3.3. The integrated squared difference test (ISDT). Another popular ba- 
sis for nonparametric inference is £^-distance based tests. In general, one 
calculates a norm related distance between the fitted nonparametric curve 
and the parametric null, and a large distance indicates violation of the null 
hypothesis. Most of the existing results on the £^ type tests are for inde- 
pendent data. See, for instance, Bickel and Rosenblatt (1973), Hardle and 
Mammen (1993), Zheng (1996), Fan, Zhang and Zhang (2001), Zhang and 
Dette (2004) and Fan and Jiang (2005) among others. 

A simple way to construct a type test is to use the statistic T„ = 
/o [Oci(^) ~ Qa{ty\^T^{t) dt. However, the bias of the local linear estimate 
Qa{t) is of order 0(6^ + ■^) and is not negligible for the asymptotic analy- 
sis. The extra bias term complicates the asymptotic distribution and reduces 
the precision of the test r„. See also Hardle and Mammen (1993) for a re- 
lated discussion in the case of conditional mean inference. To overcome the 
disadvantage, we shall use the following jackknife bias reduction technique 



Qa(i) gC2[0,1]. Note that using (11) is equivalent to using the second-order 
kernel 



To test Hq : Qa{t) = Qa{^) versus Ha : Qa{t) / Qai'^), we propose the follow- 
ing test statistic 



where 7^* = [v^ftn, 1 — V^bn] and the weight Tr{t) are assumed to be nonneg- 
ative and Lipschitz continuous in t € [0, 1]. The following theorem establishes 
the asymptotic normality of T*. 

Theorem 3. Assume that Qa(t) gC^[0,1]; that conditions (Al)-(A6) 
and (Kl) hold and that Qa{t) = Q^{t) + Qnr]{t) + o{Qn), where Q°^{t),rj{t) G 



[Wu and Zhao (2007)]: 




K*{x) ■.= 2K{x) - K{x/\/2)/V2. 



(12) 
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C[0,1], 

Qn — ^ bfi and oi^Qn) is unifovni in t on [0, 1]. Fuvthtv assume 
nhl^l \og^^ n — )• oo and nbf/'^ = 0(1) . We have 



n 



(13) 



y^T*-^K*irK*{0) [ 7r*{t)dt- [ if{t)^{t)dt 
ybn Jo Jo 



^n(^0,2 j [K*irK*{t)fdt TT*{tfd?j, 

where -k denotes the convolution operator and TT*{t) = 7r{t)a'^{t)/ f^{t,Qa{t)) . 

When rj{t) = 0, Theorem 3 unveils the asymptotic null distribution of 
T*. The ISDT can detect alternatives with the rate Qn- Under the local 
alternatives specified in Theorem 3, simple calculations based on (13) show 
that the asymptotic power of the ISDT with level /3 equals 

(14) $ JO I y J w 

\[2QK*kK*it)fdtJ,\*itrdt]y^ 

where ^{■) and denote the cumulative distribution function and the 
1 — /3 quantile of the standard normal distribution. Therefore, a simple use 
of the Cauchy-Schwarz inequality shows that choosing weights 7r(t) pro- 
portional to rj^{t)f{t,Qa{t))/a^{t) maximizes the above asymptotic power. 
Of course, in real applications it is difficult to specify r}{t). Therefore, one 
can simply choose 7r(t) = 1. On the other hand, we suggest choosing 7r(t) = 
f{t,Qait))/a'^(t), which leads to an easier implementation of the bootstrap. 
See the discussions in Section 4.1 for more details. 

Corollary 1. Under conditions of Theorem 3, T* can detect alterna- 
tives with departure rate n~^^^ if the bandwidth bn = 0(n~^/^). 

Remark 1. If Xi is distributed as N {fi{ti) , a'^) , and the Xj's are inde- 
pendent, then inference of quantile curves is equivalent to inference of the 
mean function /x(-). In this case, Ingster (1993) and Lepski and Spokoiny 
(1999) proved that the optimal rate for testing Hq : Qa{t) = Qai^) is ri~'^/^. 
Hence, the integrated squared difference test T* is optimal in the sense that 
it achieves the optimal rate of convergence. Furthermore, Corollary 1 implies 
that for nonparametric quantile function testing, weak dependence and local 
stationarity do not deteriorate the optimal rate. However, it should be noted 
that when there is long memory, the rate will be deteriorated. 

Remark 2. Since Qn-, the ISDT test T* dominates the test based 
on SCB (6) if bandwidths of the same order are used for the tests; namely 
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T* is asymptotically more powerful. Therefore a general rule of thumb is to 
use the SCB when one wants to explore the overall pattern of the quantile 
curves and to implement the ISDT test when one is interested in verifying a 
specific parametric null. On the other hand, as stated in Hardle and Mam- 
men (1993), ^^certainly from a more data analytic point of view distances 
would be more satisfactory which reflect similarities in the shape of the re- 
gression functions." For a moderate sample size, intuitively, the ISDT test 
would be more powerful compared to the SCB test if the true quantile curve 
differs from the null in a systematic and even way; while the SCB test is 
better when the latter difference is abrupt or "bumpy," in which case the 
norm does not reflect the characteristics of the difference. Hence, if there is 
some prior knowledge on the shape of the discrepancy, one could select a test 
accordingly. In Section 5, we shall conduct a simulation study to compare 
the powers of the two tests under various alternatives. 

4. Implementation. 

4.1. A wild bootstrap procedure. The asymptotic results in Section 3 are 
based on the uniform Gaussian approximations of Qa{t) — Qa{t) on (0, 1). 
It is known that the convergence rates of the C°° and £^ norms of the 
corresponding Gaussian processes are very slow (see also proofs in Section 7). 
For example, when hn{oi) = 0{n~^/^), the convergence rates of the and 
norms are 1/log^^^n and n~^/^^ , respectively. Therefore, for moderate 
sample sizes, tests based on the asymptotic theory are not reliable. 

For nonparametric inference, the bootstrap is a classic tool for achieving 
faster convergence. It is impossible to have a complete list of literature here 
and we shall only mention several representatives. Among others, Mammen 
(1993), Hardle and Mammen (1993), Chapter 8 of Shao and Tu (1995), 
Stute, Gonzalez Manteiga and Presedo Quindimil (1998), Neumann and 
Kreiss (1998) and Fan and Jiang (2007) considered wild bootstrap inference 
for conditional mean regression for independent data; Jhun (1988), Faraway 
and Jhun (1990) and Hall (1993) considered bootstrap inference of the den- 
sity function for i.i.d. data. For nonparametric inference of dependent data, 
among others, Politis and Romano (1994) proposed a stationary bootstrap 
for simultaneous inference of the spectral density functions of weakly depen- 
dent stationary time series and Wu and Zhao (2007) used a wild bootstrap 
technique to test the mean function under stationary errors. For other con- 
tributions, see Barrio and Matran (2000). 

On the other hand, there is also large literature on bootstrap methods 
for parametric quantile regression. See, for example. Chapter 3.9 of Koenker 
(2005) and references therein for independent data and Fitzenberger (1998) 
for the moving block bootstrap for strong mixing samples. 
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Despite the huge hterature on bootstrap strategies for nonparametric in- 
ference and parametric quantile regression, there have been few results on 
bootstrap methods for nonparametric quantile inference for nonstationary 
time series. The major difficulty lies in the fact that joint distributions of 
subseries within different time spans can be drastically different for a non- 
stationary process; therefore it is challenging to capture the variability of 
the process in every local structure in order to make valid nonparametric 
inferences. 

To circumvent the above difficulties, here we shall adopt a different wild 
bootstrap technique. A similar technique was proposed in Wu and Zhao 
(2007) for nonparametric inference of the mean function under stationary 
errors. The key idea is still uniform Gaussian approximation of Qa{i) — Qa{t) 
on (0,1). However, instead of resorting to asymptotic theory, we shall di- 
rectly simulate the finite sample £^ and C°° norms of the Gaussian pro- 
cesses. More precisely, let us assume the bandwidth hn{(y) = 0{n~^/^) and 
Qa{t) G C3[0, 1] for illustrative purposes. Then by the proofs of Theorems 1 
and 3 in Section 7, we can obtain after elementary calculations that 



(15) sup 



f^^^Q^^))[Q(t)-Q{t)\-Xn{t) 



a{t) 

[ [Qa{t)-Qa{t)fmdt- [ Xnitfdt 
JT' JT' 



Op(7i-ii/20log2n) 



:Op(n^log^/2 



and 
(16) 

where 

n 

3en{t)=J2'^iKliti-t)/{nbn) 
i=l 

with (Vi) i.i.d. standard normal, 7r(f) = P{t,Qa{t))/a'^{t) and /3 = -0.95. 
Note that here the bias-corrected estimator Q(-) is used in (15) in order to 
avoid estimating the unpleasant bias term jj,2b'^Q'^{t)/2 of the SCB. 

An important observation of (15) and (16) is that 3in{t) does not depend 
on the observations (Xj) and has a simple and explicit form. Therefore, one 
can generate a large sample of i.i.d. copies of X„(t) and use the distributions 
of the and norms of the sample to approximate the distributions 
of the corresponding norms of Q{t) — Q{t) by virtue of (15) and (16). The 
following are the detailed procedures. 

(1) Choose bandwidth 6„ according to the procedures in Section 4.3. 

(2) Obtain Qa,{t) by (2) and (11). 

(3) Obtain estimate cr{t), f{t,Qa{t)) from (19) and (20) below. Let 7r(t) = 
f\tMt))/a^{t). 
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(4S) Generate i.i.d. standard normal random variables Vi, i = l,2,...,n. 

Calculate Wn,s = ^^Po<t<i\^n{t)\ ■ 
(41) Generate i.i.d. standard normal random variables Vi, z = l,2,...,n. 

Calculate Wn,i = /q X'^{t)dt. 
(5) Repeat (4) B times and obtain the estimated quantile gi-/? of Wn- 
(6S) The 100(1-/3)% SCBofQ„(t) can be constructed as ^^(i) ±91-/?/ \/^- 

Accept the null hypothesis Qa{-) = Qai') level 1 — /3 if and only Q^i') 

is fully contained in the SCB. 
(61) Accept the null hypothesis Qa{-) = Qai') level 1 — /3 if and only if 

Ji[Qa{t)-Q'^{t)]H{t)dt<q,.p. 

One should use steps (l)-(3), (4S), (5) and (6S) when performing hypothesis 
testing via the SCB. On the other hand, steps (l)-(3), (41), (5) and (61) 
should be adopted when testing via the ISDT. The number of replications 
B can be chosen as 2000. It is immediate to obtain p- values of the tests. For 
instance, the p- value of the squared difference test is ¥{wnj > /q [Qait) — 
Q° (i)]^7r(t) (it), which can be estimated by the bootstrap distribution of 

Let {f{t, 9)} be a parametric family of functions that depends on t G [0, 1] 
and 9 £ & C M'^. Often one wants to test Qa{t) = fit, 6) at level 1 - /3 for 
some unknown 9 £ Q. Under the null hypothesis, we have a parametric model 
and one generally expects to obtain a root-n consistent estimator 9 of the 
true parameter value Oq by the parametric quantile regression method of 
Keonker (2005). Note that the convergence rates of our SCB and ISDT 
tests are always slower than ^/n. Therefore, if the null is true, fit, 9) can 
be treated as the true value of Qait) and one just needs to replace Q^i') in 
steps (l)-(6) by fit, 9) and the resulting testing procedures are still valid. 

The following proposition validates the above procedure for the case 
{f{t,9)} = {9'^ git)}, where (9 e M'^ and g(t) : [0, 1] ^ M'^ is a known func- 
tion. We shall first make the following constraint on g: 

(Bl) assume g(-) G C[0, 1] and Q := g(t)g^ (t) dt is nonsingular. 

Proposition 1. Assume that Qait) = g(i) for some 9q £ M'^ and that 
conditions (Al), (A5) and (Bl) hold. Then 

(17) \9a-9o\=Op{n-'/^), 

where 9a = argmin^ Y17=i Pai^i - 9'^g{i/n)). 

4.2. Estimation of the density and long-run variance functions. We see 
from (15) and (16) that obtaining a good estimate of fit,Qit))/a{t) is nec- 
essary and important for making our inferences. Here, we suggest using the 
estimation techniques in Zhou and Wu (2009), which are essentially local 
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versions of the popular subsampling long-run variance estimator and kernel 
density estimator for stationary data. Since the time series is approximately 
stationary within comparatively small time spans, the methods are shown 
to be consistent. See Section 3.4 in Zhou and Wu (2009) for more details. 
For the sake of completeness, we present the estimators here. 

For tG(0,l), let s„(t) = max([nt — n6„J , 1), Z„(t) = min([nt + n6„J,n) 
and 

(18) Afn{t) = {ien:sn{t)<i<ln{t)}. 

Let Zi^a = '4'a{Xi — Qa{i/n)). For a sequence m„ with m„ — )• oo and nbn/mn — 
oo, we shall estimate (J^(t) by 



lnit)-mn+l ^v^j+m„-l y s 2 

(19) -'it) = -^-p^ ^-Znit)) 

\Mn{t)\-mn + l .^^^^ \ rUn ) 



where Z„(t) = Yl,i<^M^{t)'Zi,oil\l^n{t)\ and |7Vn(t)| = /„(*) - s^it) + 1 is the 
cardinality of Mn{t)- 

For f{t,Qa{t)), we shall use 

(20) f^tMt)) = jj^^-^ KliQa{t)-X,), 

where K"^ E /C is a kernel and is the bandwidth satisfying hn ^ and 
nbnhn — )• oo. 

We refer to Zhou and Wu (2009) for a discussion on the selection of the 
tuning parameters m„ and t„. 

4.3. Bandwidth selection. Choosing a good bandwidth bn{a) is impor- 
tant in practical applications. For quantile curve estimation, Zhou and Wu 
(2009) proposed the following way to choose the bandwidth based on modi- 
fications of existing bandwidth selectors for independent data. By Theorem 
1 of the latter paper regarding asymptotic normality of Qa(t), we have 



(21) 



fen.(Q) 



li'^Ht)dt 
a(l — a) 



1/5 



p*ia), 



where b'!^{a) denotes the optimal weighted asymptotic mean integrated 
squared error (AMISE) bandwidth, b}^'^{a) is the optimal bandwidth ob- 
tained under independence and p* (a) is called the variance correction factor 
which accounts for the dependence. Note that a{l — a) = Var( J(t, Qa{t),J^i))- 
For independent data, there have been many discussions on bandwidth selec- 
tion for nonparametric quantile estimation; see, for instance, Yu and Jones 
(1998), Fan and Gijbels (1996) and Ghosh and Draghicescu (2002) among 
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others. Hence, one could first select a bandwidth f)™*^(a) by treating the data 
as if they were independent. After that, the variance correction factor p*{a) 
can be estimated by the following: 

(^2)1/5 



where 




n — m + _ 

i=i 



= '>Pa{Xi - <3a,femd(„)(t)), ?„,q, = X^^^iQ.a/n and m = [n^/^J. It can be 
shown that p*{a) is a consistent estimate of p*{a) and we shall point out 
that the selected bandwidth h*^{a) typically varies with respect to a [see 
also Yu and Jones (1998)]. Moreover, since the jackknife bias reduction tech- 
nique reduces the bias of the local linear quantile estimates to second order, 
following Wu and Zhao (2007), we suggest using 6Jf^^(a) = 2bl{a) for the 
nonparametric estimation when the jackknife is implemented. We refer the 
interested reader to Sections 3.1.1 and 3.3 of Zhou and Wu (2009) for more 
details on the bandwidth selection. 

The bandwidth selected for quantile curve estimation provides a reason- 
able starting point for nonparametric tests [Fan and Jiang (2007)]. For the 
SCB test, we suggest using the bandwidth 6|f^ (a) following Eubank and 
Speckman (1993) and Wu and Zhao (2007). For the ISDT test T*, Corollary 
1 implies that the bandwidth which renders the optimal power is of order 
^-2/9 _ 1/5 1/45 _ Following Fan and Jiang (2007), we suggest using the 
bandwidth l^^^^ia) x n-^/^^ for the ISDT test. 

5. Simulation studies. In this section, we perform simulation studies to 
investigate the accuracy and power of the proposed tests for moderate sam- 
ple sizes. Let us consider the following time-varying AR(1) model 

(22) G{t,F^) = a^{t)+ai{t)G{t,F^^i) + 5{t)e,, 

where ao(*)i ai(i) and 5{t) are continuous functions on [0, 1], maxt]ai(f)] < 1, 
min^ 6{t) > and £{ are i.i.d. with \\£i\\p < oo for some p> 0. We observe the 
time series Xi = G{i/n,Fi), i = 1, 2, . . . ,n. 

It is clear that G{t,F'i) has the representation 

(23) G{t, F) = + 5{t) f;[ai(t)]%._,. 

The UGMC and local stationarity conditions of (22) can be easily verified 
by the results in Section 4.1 of Zhou and Wu (2009). 
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5.1. Accuracy of the SCB and ISDT tests. In this subsection, we describe 
a simulation study to compare the accuracy of the asymptotic and bootstrap 
tests for both hght tailed and heavy tailed processes. To this end, we shall 
use the time-varying AR(1) process (22) with ao(i) = 0, ai{t) = sin(27ri)/2 
and 5{t) = exp((t — 1/4)^). Consider the following two scenarios: 

(a) e,~ Af(0,l); (b) ~ 5qS(1.8). 

Here, 505(1.8) denotes the standard symmetric a stable distribution with 
index 1.8 which has the characteristic function exp(— |t|^'^), t G M. It is easy 
to show that the 5a5(1.8) distribution has mean and infinite variance. 
Therefore, scenarios (a) and (b) represent light tailed and heavy tailed pro- 
cesses, respectively. Elementary calculations show that under scenarios (a) 
and (b), 

'?°'"= [i-Mt)l1'/- '^°' 

where is the ath quantile of £i and v = 2 and 1.8 in scenarios (a) and 
(b), respectively. 

We shall compare type I error rates of the following six tests: the asymp- 
totic SCB test (AS) based on Theorem 1; the asymptotic ISDT test (AI) 
based on Theorem 3; the bootstrap SCB test (BS); the bootstrap ISDT 
test (BI); the asymptotic point-wise confidence band (PC) and the Bonfer- 
roni confidence band (BF) based on Theorem 1 of Zhou and Wu (2009). 
The Bonferroni confidence band is simply the point-wise confidence band 
at level (3/n, where (3 is the desired level. We generate time-varying AR(1) 
processes under scenarios (a) and (b) with length n = 300 and perform the 
above six tests at the nominal level 5% for the following four quantile curves 
a = 0.5, 0.75, 0.9 and 0.95. Bandwidths are chosen according to Section 
4.3 and the critical values go. 95 of the bootstrap tests are based on 2000 
bootstrap samples. The simulated type I error rates with 1000 replicates are 
shown in Table 1 below. 

It is clear from the output of Table 1 that point-wise confidence bands 
are not appropriate for nonparametric inference. The Bonferroni confidence 
band test is too conservative, namely the band is too wide. On the other 
hand, the asymptotic SCB test (AS) is conservative and the asymptotic 
ISDT test (AI) tends to slightly inflate the type I error. As discussed in 
Section 4.1, the loss of accuracy of the asymptotic tests is due to their slow 
convergence rates. 

For the bootstrap tests, the nominal type I error is achieved except for 
extreme quantiles of heavy tailed processes. It is not difficult to see that 
data is relatively sparse at high quantiles and very large jumps occur more 
frequently for heavy tailed processes. These facts suggest that for extreme 
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quantile inference of heavy tailed processes, a relatively large sample size is 
needed in order to achieve the desired accuracy. We performed the bootstrap 
tests at 5% level for scenario (b) for n = 500 and with 1000 replicates. Simu- 
lated type I errors of the BS and BI tests were 0.061 and 0.056, respectively, 
for the 90% quantile curve. However, for the 95% quantile, the corresponding 
simulated type I errors were 0.139 and 0.131, respectively, which were still 
way larger than the nominal. When n was increased to 1000, accuracy of 
the bootstrap tests were achieved for the 95% quantile curve under scenario 
(b). 

5.2. Power comparison of the SCB and ISDT tests. As discussed in Re- 
mark 2, for a moderate sample size, power of the SCB and ISDT tests 
are greatly influenced by the shape of Qa{t) — Q°a{i)- Recall that Q°a{t) 
is the hypothesized value of Qa{t)- In this subsection, we present a sim- 
ulation to compare the power performance of the above two tests under 
various shapes of Qa{t) — Q°a{t)- To this end, we consider model (22) with 
aoit) = Lp{t){l-ai{t)), ai(t) =sin(27rt)/2, = exp((i - 1/4)2) ^. j j j_ 
iV(0,l). Then 

(25) Q^{t) = ^{t) + Q-^{t) where Q° (t) = — Q^. 

We test the hypothesis Ho:Qa{-) = Qai') versus Ha'-Qai') 7^ Qai')- Con- 
sider the following two situations: 

(i) ip{t) = ci; (ii) ifit) = C2 exp(-C3(t - 1/2)^), 

where q, i = 1,2,3, are positive constants. Cases (i) and (ii) correspond to 
flat and bumpy differences of the true and hypothesized quantile curves, 
respectively. Note that as C3 gets larger, we observe sharper peaks in (p{t). 
In our simulations, we follow steps (1) to (6) in Section 4.1 and perform the 



Table 1 

Simulated type I error rates for the six tests with nominal level 5% under scenarios (a) 
and (b) . Series length n — 300 with 1000 replicates 



Test 


a = 


0.5 


a = 


0.75 


a = 


0.9 


a = 


0.95 


(a) 


(b) 


(a) 


(b) 


(a) 


(b) 


(a) 


(b) 


AS 


2.2% 


2.6% 


2.7% 


2.8% 


2.1% 


9.9% 


2% 


21.7% 


AI 


8% 


6.5% 


7.2% 


6.9% 


7.4% 


10.4% 


6.7% 


21.4% 


BS 


4.8% 


6% 


4.4% 


5.4% 


4% 


10.4% 


4% 


23.8% 


BI 


5.6% 


5.3% 


5.2% 


5.2% 


5.2% 


9.3% 


5.5% 


20.2% 


BP 


1.5% 


0.8% 


1.7% 


1.6% 


1.3% 


5.5% 


0.2% 


14.1% 


PC 


61.3% 


61% 


53.9% 


53.1% 


45.4% 


40.8% 


45.7% 


44.2% 



18 



Z. ZHOU 



SCB and ISDT tests at 5% level with n = 300 and a = 0.5, 0.75, 0.9 and 
0.95. Various values of ci and (02,03) are investigated and for each of the 
values we perform 1000 replicates and record the simulated probability of 
rejecting the null hypothesis. The simulated power curves of case (i) and (ii) 
are shown in Figures 1 and 2, respectively. 

Generally, the displays in Figures 1 and 2 are consistent with our ar- 
guments in Remark 2 that for a moderate sample size the ISDT test is 
more powerful when (p(-) is flat and the SCB test performs better when ip{-) 
changes abruptly. On the other hand, when the peak of ip{-) is not sharp 
enough; namely when C3 is relatively small, we see from Figure 2 that the 
ISDT test is still more powerful than the SCB test, which can be explained 
by the fact that the ISDT test asymptotically dominates the SCB test. 

6. The global tropical cyclone data. One of the most important conse- 
quences of global warming is the increase of ocean temperature. Theoret- 
ical arguments and modeling studies indicate that tropical cyclone winds 
should increase with increasing ocean temperature [Eisner, Kossin and Jag- 
ger (2008)]. Meanwhile, climatologists are very interested in finding empirical 
evidences on the change of intensity of tropical cyclone winds. In a recent 
paper, Eisner, Kossin and Jagger (2008) tackled the latter problem partially 
by fitting linear trends for quantiles of the global tropical cyclone data. The 
data set contains satellite-derived lifetime-maximum wind speeds of 2098 
tropical cyclones over the globe during the period 1981-2006. It is avail- 
able at James Eisner's website at http://myweb.fsu.edu/jelsner/extspace/ 
globalTCmax4.txt. We shall refer to Eisner, Kossin and Jagger (2008) for 
a detailed description on how the data are obtained and the related issues. 
Figure 3 shows the time series plot of the data. Significant linearly increasing 
trends were found in high quantiles of the global tropical cyclone data in Ei- 
sner, Kossin and Jagger (2008). In other words, the worst tropical cyclones 
are getting stronger over the globe. 

In this section, we are mainly interested in the following issues. First, 
we shall compare our quantile-based tests with the mean-based approaches 
to see whether the increase of intensity of the worst tropical cyclones is 
due to a shift of mean. Second, we shall test whether the linear trend used 
in Eisner, Kossin and Jagger (2008) is sufficient to describe the change of 
the high quantiles of the global cyclone winds. This is an important issue 
because the parametric linearity assumption implies homogeneous change of 
the wind speeds. However, much complex yet important information on the 
dynamics of high wind speeds may be buried under this assumption. 

To address the first issue, we follow the procedures in Zhou and Wu 
(2010) and provide a SCB for the mean curve of the global tropical cyclone 
series. The bandwidth is chosen as 0.16. The left panel of Figure 4 shows 
the 95% SCB. The height of the horizontal line is the average of all cyclone 
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Power curve for the median Power curve for tfie 75% quantile 




0.1 0.2 0.3 0.4 02 0.3 04 0.5 




0.3 04 0.5 0.6 0,4 0.5 6 0.7 



Cl Cl 

Fig. 1. Power curves for the 50%, 75%, 90% and 95% quantiles under case (i) of Section 
5.2. The solid lines are the power curves for the ISDT test and the dashed lines are the 
power curves for the SCB test. 



Power curve for the median, c2=1 Power curve for the 75% quantiie, c2=1 .5 




Fig. 2. Power curves for the 50%, 75%, 90% and 95% quantiles under case (ii) of Section 
5.2. The solid lines are the power curves for the ISDT test and the dashed lines are the 
power curves for the SCB test. 
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wind speeds. It is clear that the horizontal line is fully contained in the 
SCB. The p-value for testing constancy of the mean trend is 0.11. Therefore, 
statistically there is no trend in the average tropical cyclone speeds. In fact, 
one of the main reasons why scientists did not find interesting signals in the 
intensity of global tropical cyclone winds before Eisner, Kossin and Jagger 
(2008) is due to the focus on mean trends. 

We performed the SCB and ISDT tests on the linearity of the 90% quan- 
tile curve to address the second issue. The bandwidths of the SCB and ISDT 
tests are chosen as 0.21 and 0.18, respectively. Both tests give p- values less 
than 0.001. The right panel of Figure 4 shows the 95% SCB. It is clear from 
the SCB that there is an inhomogeneous increasing trend of the 90% quan- 
tile curve. More specifically, the quantile curve underwent a sharp increase 
during the period November 1982 to September 1992, and after that the 
trend became flat. This information cannot be retrieved if the linear trend 
analysis in Eisner, Kossin and Jagger (2008) is adopted. Furthermore, the 

Tropical Cyclone Lllcllme-maximum Wind 
Speed {1981-2006} 



Fig. 3. Time series plot of tropical cyclone lifetime-maximum wind speed (1981-2006). 



Mean Curve 90% Quantile Curve 




Tropical Storm Number Tropical Storm Number 



Fig. 4. Trends in mean and 90% quantile of global tropical cyclone lifetime-maximum 
wind speed, with 95% simultaneous confidence hands. 
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inhomogeneous trend in high quantiles provides chmatologists with useful 
information on the underlying complex mechanisms producing the hurricane 
climate. 



7. Proofs. Unless otherwise specified, we will only prove results for a = 
1/2, since results for other quantiles follow similarly. We shall also omit the 
subscript a in the notation if no confusion will be caused. 

Consider a system H{t,x,J^i) defined in Section 2.2. Let ^r: [0,1] i— ?• R be 
a measurable function. Define S){t^Fi) = H{t,g{t),Fi). Then Zi := S^{ti,Fi) 
i = 1,2, ... ^n, defines a nonstationary time series. Recall ti = i/n. Let Sz{i) = 
Yl^j=i ■^j-> ^ = 1,2, . . . ,n. The following invariance principle for {Zi)^^^ plays 
an important role in both establishing the asymptotic theory of the testing 
methods and justifying the wild bootstrap procedures. 

Theorem 4. Assume that (i) Eio(t, J'j) = for all t G [0,1]; that (ii) 
sup(j^a;)e[o,i]xRll-f^(*)^5-^i)ll4 < foT some finite constant cq; that (iii) H 
satisfies UGMC(4) and that (iv) there exists q > 1/4, such that for all i>0 

(26) \\Vo{Msi,T^)-^j{s2,T^)}\\<C\si-S2\'' 

holds for all si,S2e [0,1], where Vk{-) = 'E{-\J='k) -E{-\Tk^i) for k>0. Then 
on a richer probability space, there exists i.i.d. standard normal random 

variables {Vi)i and a process S'^{i) with {S'^{i)}f^i = {Sz{i)}i'=i, such that 



(27) max 



i<n 



: 0^(77,^/" log^n). 



where v{s) = [Y^ 1^^^ cov {H {s, g {s), Tq), H{s,g{s),J'k))Y^^ , < s < 1. 

Proof. Note v{s) = ||7^oEfcio^(S'-^fc)ll- Let S^{t,I^^) = if t > 1. By 
Corollary 1 of Wu and Zhou (2010), it follows that under conditions (i) and 
(ii), there exist i.i.d. standard normal random variables (V^)" and a process 



5^(i) with {S°z{i)}2=i = {Sz{i)}i=i, such that 



(28) max 



i<n 



.(nV4log3/2 



n 



where vj = ||7^o Z]fclo^(*i + ik,^k)\\- Based on (26), 

(29) \\vo{m,^k)-m+tk,n)}\\<ti 

for < k < n — i. On the other hand, by Theorem 1 of Wu (2005), we obtain 

(30) \\Vo{m,^k) - m + tk,J'k)}\\ <26Hik,2) 
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for all A; > 0. By (29), (30) and the fact that 5h(A;,2) < SnikA) = 0{x^) for 
some < X < 1) we have for all 1 < j <n — [n^/^] , 

oo 2 



{v{i/n) -Vjf < 



k=0 



< [J2\\'Pomtj,Tk)-S)itj+tk,Tk 



\k=0 
t k* 



\k=0 



k=k* 



where k* = [— logn/logxj . Note for n — [n^/^] < j <n, {v{j/n) — Vj)"^ 
0(1). Therefore, ^"^^(f (j/n) - t)^)^ = 0(ni/2 log^/^ ^). Hence, 



max 

i<n 



(31) 



Op((ni/2 log5/2^)V2iogV2„) 



.(n^/^log 



n) 



By (28) and (31), Theorem 4 follows. □ 

Remark 3. The Gaussian approximation in Theorem 4 shows that par- 
tial sums of a short range dependent (SRD) locally stationary process can be 
approximated by weighted sums of i.i.d. standard normal random variables. 
We shall call v'^{t) = '^j^^j^cov{S^{t,FQ),9){t,Fk)) the long-run variance of 
the system {^{t,Fi)} at time t. The weight v{t) captures the local depen- 
dence of the series {Zi) at t; while the fluctuation of v{t) on [0, 1] is due to 
the nonstationarity of the series. Following the arguments in Wu and Zhou 
(2010), the bound op(n^/^ log^ n) in (27) is optimal within a multiplicative 
logarithmic factor. 



We shall state and prove the following Lemma 1 and Lemma 2 before we 
proceed to the proof of Theorem 1. 

Lemma 1. Assume Q{t) gC^[0, 1]; that conditions (Al)-(A6) and (Kl) 

3 /2 

hold, and that bn^O with nbn oo, then we have 

snp\f{t,QmQ{t)-Q{t)]-Sn{t)-ES:Xt)\ 
teTn 
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(32) Sn{t) = Y,'^{Xi - Q{U))Kb„{t - U)/{nhn) 

1=1 

with ipa{x) = a — I{x < 0}, and 

n 

S*n{t) = Y.'^{Xi - Q{t) - {ti - t)Q'{t))KbAt - ti)/{nbn). 



i=l 



Proof. A careful check of the proof of Theorem 3 of Zhou and Wu 
(2009) shows that under conditions of Lemma 1 



(33) 



sup|/(t,g(t))[Q(t)-Q(t)]-5:(t)| 



1/2 1 



Let ^(i, t) = iP{Xi - QiU)) - iP{Xi - Q{t) - (U - t)Q'(t)), 

n 

Mn{t) = J2 *)^b„ {t - ti)/{nb„), 



1=1 

n 



Nn{t) = ^{E[V(i, t)\Ti.i] - Ei;{i, t)}Kb„ {t - ti)/{nbn). 



1=1 



Note the summands of Mn{t) form a triangular array of martingale differ- 
ences and Nn.{t) is differentiable with respect to t. Using similar chaining 
arguments as those in the proof of Lemmas 5 and 6 in Zhou and Wu (2009), 
we have 



(34) sup |M„(t)| =Off 
te[o,i] 



ftnlogn 
\/nbn 



sup \Nn{t)\=Of 
t6[0,l] 



(,3/4 

fnb^ 



Note Sn{t) - (5*(t) -E5*(t)) = Mnit) + Nnit). Therefore, by (33), (34) and 
the fact that 6nlogn = o(6n^^), we show that Lemma 1 holds. □ 

Lemma 2. Let {Vi} be i.i.d. standard normal random variables. Assume 
condition (Kl) and bn — )• with n6,j/(logn)^ — )■ oo. Then for any x G M 



lim I 



1 



sup 

n taTn 



Y^V,K,^{ti-t) 



i=l 



B{m*) < 



\/21og m* 



-2e- 
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Lemma 2 follows from classic results for extremes of Gaussian processes. 
See, for example, Bickel and Rosenblatt (1973) and Lemma 2 of Wu and 
Zhao (2007). Details are omitted. 

Proof OF Theorem 1. Consider the system {1/2- J(t,2;,J'i)}, (t,x) G 
[0,1] X M. Recall J{t,x,Ti) = I{G{t,Fi) < x}. Let Z{t,Ti) = 1/2 - J(t, 
Q{t),J^i). We shall first show that Z{t,J'i) satisfies conditions of Theorem 
4. Obviously KZ{t,Fi) = and ||l/2 - J{t,x,Fi)\\i < 1 for ah t G [0, 1] and 
X G M. Based on (A5), condition (iii) of Theorem 4 holds. We now check (iv). 
Note for any f > 

\\VQ{Z{si,Fi) -Z{s2,:Fi)}\\ < \\Z{si.:F,)-Z{s2,Ti)\\ 

(35) 

= ||J(S1,Q(S1),J-,)- J(S2,Q(S2),-F.)||. 

Based on (A6) and the smoothness condition on Q(t), 

(36) \\J{s2.Q{si),Ti) - J{s2,Q{s2),m\ < C\si-S2\^'^. 

On the other hand, 

(37) \\J{si,Q{si),Ti)-J{s2,Q{si),Ti)\\<I + II, 

where /= ||[J(si,g(si),J-,)- J(s2,Q(si),J-i)]/{|Ci(5i)-0(s2)| <<5}|| and 
// = {|Ci(si) - Ci{s2)\ > 5}\\. Recall Qit) = G{t,Ti). Using condition (A6), 
we have for all (5 > 0, 

/ < \\I{Q[si) - 6 < Usi) < Q(si)}|| + \\I{Q{si) < Usi) < Q(si) + <^}|| 

(38) 

By condition (A3), there exists q>l, such that 

(39) „<iic.(^i)-;.fe)iir,^k-gK^ 

Let 6=\si- S2|''/(''+^), then (37), (38) and (39) imply that 

(40) \\J{si,Q{si),Ti) - J{s2,Q{si),T.i)\\ <C\si - S2\', 

where s = 2{i+g) — i- (^6) ^^^'^ (40), we have condition (iv) of The- 

orem 4 holds. Therefore, Theorem 4 implies that there exist i.i.d. standard 
normal random variables (1^)", such that 



(41) max 



J = l 
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where S^{i) = Yl\=i-3i'tk,J^k)- Recall a{t) was defined in (4). Define 

n 

(42) En{t)=Y,^{t^)V^Kh^{U - t)/{nbn), 



i=l 



(43) a„(i) = K,^{h -t) + Y, \KbAt^ -t)- K,Sh-i -t)\\ (nbn). 



1=2 



By the summation by parts formula and (41), we obtain 



sup|S'n(i) — H„(t)| < supr2„(i)max 



(44) 



n 



l/4log2 



n 



nbr, 



By the Lipschitz continuity of a{t) in (A2), it is easy to see that 

'6.„logn' 



(45) 



sup|H„(i) -H*(t)| =Of 



where ^nit) = <^it)I17=iViKbAU - t)/{nbn). By Lemma 1, (44), (45) and 
the fact that 6„log?i = o{bn^), it follows that 



(46) 



snp\f{t,QmQ{t)-Qit)]-EUt)-ESUt)\ 



1/2 1 , 1,3/4 

\fnK, 



+ briT^n + 7r„ + 



n 



l/4log2 



n 



nbr. 



It is easy to check that under bandwidth conditions of Theorem 1, the right- 
hand side of (46) is of order op((n6„logn)^^/^). On the other hand, by 
condition (Al) and a Taylor expansion, we have 



sup|E5:(t) - ii2f{t,Q{t))Q"{t)hM =0[bi + ^ 



nbr 



(47) 

= o{{nbn\ogn) ^/'^). 
By Lemma 2, (46) and (47), Theorem 1 follows. □ 



Let 



Sn{t) = Y,i^{X, - Q{ti))Kl{t - U)/{nbn), 



(48) 



i=l 



Kit) = Y,^{X, - Q{t) - {U - t)Q'{t))Kl^{t - ti)/(n6„). 



i=l 
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2)n(t) = Qit) - Q{t) - ES?,it)/fit, Qit)). 



We shall introduce several lemmas before proceeding to the proof of Theorem 
3. 

Lemma 3. Under the conditions of Theorem 3, we have 

(50) j ^n{t)7T{t)dt^ n(^0, ZU^{t)dt^, 

where zu(t) = 7r{t)a{t)/ f{t,Q{t)) . 



Proof. By Lemma 1, we have 
(51) 



sup|/(t,Q(i))2)„(t)-5„(t)| 

ter* 



1/2 1 , 1,3/4 

TTn' logn + V 2 1 

— + bn-^n + 7r„ . 



'nbr, 



On the other hand, arguments similar to those in the proof of (44) and 
(45) lead to 



(52) 
where 

(53) 



sup|5„(t) - a{t)Xnit)\ 



n}!^ log^ n log n 



nbr, 



'nbr. 



^n(^) = X^W„(^^-^)/(nM• 



Recall (Vi)" are i.i.d. standard normal random variables defined in (41). 
Hence, by (51), (52) and the bandwidth conditions of Theorem 3, it follows 
that 



(54) 



sup 



2)n(t) 



ait) 



f{t,Q{t)) 

Furthermore, it is easy to obtain that 



.(n-V2) 



(55) j Xri.{t)w{t)dt^ n(^0, w'^{t)dt 

By (54) and (55), this lemma follows. □ 
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Lemma 4. Recall (13) for the definition of Tx*{t) and (53) for the defi- 
nition ofXnit). We have 



7ly/bn I Xn{tf7T*{t)dt 



1 

-^/r*/r(o) / TT*{t)dt 

vK Jo 



n(o,2 j [K* i.K*{t)fdt TT*{tfdt^. 



Proof. Define 



In = E^' / [^6„(i-i)]'vr*(t)dt/(n53/2), 
1=1 ■^T'n 

iin= Yl R'^Ri I Ki{U-t)KlSh-t)^*ii)di/i^^T)- 

Then by the central hmit theorem for /„, it fohows that 

/„ = ^K**K*(0) [\*{t)dt + oJ^ + ^=] 
Vbn Jo \nhyl VnbnJ 



(56) 



1 

/bn Jo 



On the other hand, by Theorem 2.1 of de Jone (1987), elementary but tedious 
calculations show that 



(57) 



Iln^N (^0,2 J [K*i.K*{t)fdt TT^tfdt^. 



bmce n 

y/K Jr* ^n{tfT^*{i) dt = In + Hn, by (56) and (57), the lemma fol- 
lows. □ 

Proposition 2. Let fn = f^, 2)^(t)7r(t) dt. Then under the conditions 
of Theorem 3, we have 



1 



n^bnTn- ^K**K*{0) / TT*{t)dt 



(58) 



n(^0,2 j [K* i.K*{t)fdt TT*{tfdt^. 



(59) 



Proof. By (51) and (52), we have 

a{t) 



sup 



2)n(t) 



f{t,Q{t)) 
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where Vn 



On the other hand, it is easy to show that 



(60) 



sup |X„(t)| = Of 
t&T* 



Therefore, (59) and (60) imply that 



(61) 



sup|2)„(t)| =Op 

ter* 



log^/^ n 



log-*^/^ n 
\/nhr, 



Hence, by (59), (60), (61) and bandwidth conditions of Theorem 3, 



Tn - I [Xn{t)YT^*{t)dt 



(62) 



< C sup I A„(t)| sup 

tdT' teT' 



|2)n(t)| + 



Xn{t) ]TT{t)dt 

a{t) 



f{t,Q{t)) 



(t) 



log^/^ n 



1 



where A„(t) = 2),i(t) — j^^^^j^jC„(t). Therefore, by Lemma 4, we ihustrate 
that Proposition 2 holds. □ 

Proof of Theorem 3. First, note 

Q{t) - Q°{t) = 2)„(t) + E5°(t)//(t, Q{t)) + QMt) + 0(1)). 
Therefore, 



(63) 



T* = r„ + / [EcS°(t)//(t, Q(t)) + QnHt) + o(l))]^7r(t) 

+ 2 / 2)„(t)[E5°(t)//(t,Q(0) + en(7?(t)+o(l))]7r(t)a!t 



■ —T -\- T* -\- Tl* 
Since (5(t) G C^[0, 1], simple calculations show that 

1 



(64) 



snv\&S"^{t)\=o{hl) + 0[ — 
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By the bandwidth conditions of Theorem 3, it follows that 
(65) II - qI [ r,\t)T,{t) dt = o{bi + blQn) = o{qI). 

By Lemma 3, 

2)„(t) [r7(t) + o(l)]^(t) dt = Op(^„n-i/2) 

= op(n-i(6„)-'/'). 



(66) 



Similarly, 

(67) / 2J„(t)[E5°(t)//(t,Q(t))]7r(t)dt = op(n-i(6„)~'/') 



T* 

Hence, Proposition 2, (63), (65), (66) and (67) imply that Theorem 3 holds. 
□ 

Remark 4. Under the null hypothesis Q{t) = Q°{t), we see from the 
above proof that the bias Bn{t) influences the asymptotic behavior of T* 
through two terms f^* B'^{t)Tr{t) dt and 2).„(t);B„(t)7r(t) dt, where Bn{t) = 
KS°{t)/ f{t,Q{t)). The jackknife bias reduction technique reduces those two 
effects to second order. However, it can be shown that if the original esti- 
mate Q{t) is used in the test, then the first term is not negligible under 
the optimal bandwidth 6„ = 0(n~^/^), which complicates the asymptotic 
analysis and reduces the precision of the test. 

Proof of Proposition 1. Proposition 1 follows from Lemmas 5 and 
6 below. □ 

Lemma 5. Let Ln = Y17=i ^(^i)g(*i)- Then under the conditions of Propo- 
sition 1, we have Ln = Of>{^/n). 

Proof. Let Ln^k = '^^=i'Pi-k'4'{ei)s{'ti)- Recall the operator is de- 
fined in Theorem 4. Then the summands of L„ ^ are martingale differences. 
By the orthogonality, we have for k>l 

n n 

\\Ln,kf = ll^^-fcV'(e^)g(^^)f = ^ | g(t») | ' ri-fc^(ei) f 
i=l i=l 

<CnSl{k- 1,2). 

Similar arguments also imply ||L„ < Cn. Therefore, 

oo / oo \ 

||L„|| = ^||L„,fc|| <Cy^ l + ^5i.(A:-l,2) <CV^. 

k=0 \ k=l ) 
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Hence, Lemma 5 follows. □ 

Lemma 6. Under the conditions of Proposition 1, we have 

n 

y-ada - y^V^a(ei)g(ti)/\/n = Op(l), 



1=1 



where 6a = \/n{9a — 9o), ei = Xi — QaiU) and 

-1 



Ha 



git)g' {t)f{t,Qait))dt. 







Proof. We shall omit the subscript a in the proof. Let gn{t) = g{t) / ^Jn. 
By argmnents similar to those of Lemma 3 of Wu (2007), we have for any 
fixed constant c and fixed < c, 



(69) 



varK^,?,(0) =o(l), 



.i=l 



where r]i{9) = p{ei - gZ{ti)6) - p{ei) + g^{ti)9ip{ei). On the other hand, 
elementary calculations based on the Taylor expansion show that 



(70) 



i=l 



^^^^4^\snXm'+oi\g:it,)9\^) 



1=1 

9'^n9 



+ 0(1). 



From equations (69), (70) and the convexity lemma [Pollard (1991), page 
187], we obtain 



(71) sup 



^[p(e, - gl{ti)9) - p{ei) + gl{ti)9ij{e. 



9^n9 



i=l 



Op(l). 



Now a standard argument using properties of convex functions will lead to 
(68). See, for example, the proofs of Theorems 2.2 and 2.4 in Bai, Rao and 
Wu (1992). Details are omitted. □ 
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